Adaptive Look-Ahead Planning

نویسندگان

  • Sebastian Thrun
  • Knut Möller
  • Alexander Linden
چکیده

We present a new adaptive connectionist planning method. By interaction with an environment a world model is progressively constructed using the backpropagation learning algorithm. The planner constructs a look-ahead plan by iteratively using this model to predict future reinforcements. Future reinforcement is maximized to derive suboptimal plans, thus determining good actions directly from the knowledge of the model network (strategic level). This is done by gradient descent in action space. The problem of nding good initial plans is solved by the use of an \experience" network (intuition level). The appropriateness of this planning method for nding suboptimal actions in unknown environments is demonstrated with a target tracking problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis and Repair Iteration Planning versus N-Step Look Ahead Planning

In this paper we compare the performance of planning algorithms distinguishing and iterating between observation (diagnosis) and repair (action) phases with algorithms extending conventional planning methods with observations using n-step look ahead. Diagnosis and repair iteration planning algorithms are an extension of earlier work of Friedrich and Nejdl, n-step look ahead planning including d...

متن کامل

Pipelined adaptive IIR filter architectures using scattered and relaxed look-ahead transformations

Fine-grain pipelined architectures for adaptive infinite impulse response (AIIR) filters are presented in this paper. The AIIR filters are equation error based. The proposed architectures are developed by employing a combination of scattered look-ahead and relaxed look-ahead pipelining techniques. First, a pipelined system identification scenario is developed. Then, the scattered look-ahead tec...

متن کامل

Effect of driving experience on anticipatory look-ahead fixations in real curve driving.

Anticipatory skills are a potential factor for novice drivers' curve accidents. Behavioural data show that steering and speed regulation are affected by forward planning of the trajectory. When approaching a curve, the relevant visual information for online steering control and for planning is located at different eccentricities, creating a need to disengage the gaze from the guidance of steeri...

متن کامل

Annihilation-reordering look-ahead pipelined CORDIC-based RLS adaptive filters and their application to adaptive beamforming

The novel annihilation-reordering look-ahead technique is proposed as an attractive technique for pipelining of Givens rotation (or CORDIC) based adaptive lters. Unlike the existing relaxed look-ahead, the annihilation-reordering look-ahead does not depend on the statistical properties of the input samples. It is an exact look-ahead and based on CORDIC arithmetic, which is known to be numerical...

متن کامل

Movement-Based Look-Ahead Traffic-Adaptive Intersection Control ⋆

There exist several control approaches for traffic signal control such as fixed-time, vehicle-actuated, or look-ahead traffic-adaptive control. We argue that in order to flexibly deal with varying demand levels movement-based control (which is already common in vehicleactuated intersection control) is required instead of stage-based control (which is still employed in the state-of-the-art in lo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990